Universal Dependencies: A cross-linguistic typology

نویسندگان

  • Marie-Catherine de Marneffe
  • Timothy Dozat
  • Natalia Silveira
  • Katri Haverinen
  • Filip Ginter
  • Joakim Nivre
  • Christopher D. Manning
چکیده

Revisiting the now de facto standard Stanford dependency representation, we propose an improved taxonomy to capture grammatical relations across languages, including morphologically rich ones. We suggest a two-layered taxonomy: a set of broadly attested universal grammatical relations, to which language-specific relations can be added. We emphasize the lexicalist stance of the Stanford Dependencies, which leads to a particular, partially new treatment of compounding, prepositions, and morphology. We show how existing dependency schemes for several languages map onto the universal taxonomy proposed here and close with consideration of practical implications of dependency representation choices for NLP applications, in particular parsing. This paper is a minor revision of our LREC 2014 paper “Universal Stanford dependencies: A cross-linguistic typology”. The content is largely identical, but the taxonomy of relations has been revised to be consistent with the final Universal Dependencies (UD) taxonomy for version 1.0 described at http://universaldependencies.github.io/docs/. Version of 2015-11-12.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Universal Dependencies: A Cross-Linguistic Perspective on Grammar and Lexicon

Universal Dependencies is an initiative to develop cross-linguistically consistent grammatical annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning and parsing research from a language typology perspective. It assumes a dependency-based approach to syntax and a lexicalist approach to morphology, which together entail that the funda...

متن کامل

Universal Stanford dependencies: A cross-linguistic typology

Revisiting the now de facto standard Stanford dependency representation, we propose an improved taxonomy to capture grammatical relations across languages, including morphologically rich ones. We suggest a two-layered taxonomy: a set of broadly attested universal grammatical relations, to which language-specific relations can be added. We emphasize the lexicalist stance of the Stanford Dependen...

متن کامل

Linguistic Typology meets Universal Dependencies

Current work on universal dependency schemes in NLP does not make reference to the extensive typological research on language universals, but could benefit since many principles are shared between the two enterprises. We propose a revision of the syntactic dependencies in the Universal Dependencies scheme (Nivre et al. [16, 17]) based on four principles derived from contemporary typological the...

متن کامل

MarsaGram: an excursion in the forests of parsing trees

The question of how to compare languages and more generally the domain of linguistic typology, relies on the study of different linguistic properties or phenomena. Classically, such a comparison is done semi-manually, for example by extracting information from databases such as the WALS. However, it remains difficult to identify precisely regular parameters, available for different languages, t...

متن کامل

Slavic Languages in Universal Dependencies

Universal Dependencies (UD) is a project that is developing crosslinguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning and linguistic research from a language typology perspective. It is a merger and extension of several previous efforts aimed at finding unified approaches to parts of speech, morph...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015